Average AoI Minimization for Energy Harvesting Relay-aided Status Update Network Using Deep Reinforcement Learning
نویسندگان
چکیده
A dual-hop status update system aided by energy-harvesting (EH) relays with finite data and energy buffers is studied in this work. To achieve timely updates, the best should be selected to minimize average age of information (AoI), which a recently proposed metric evaluate freshness. The AoI minimization can formulated as Markov decision process (MDP), but state space for capturing channel buffer evolution grows exponentially number relays, leading high solution complexity. We propose relay selection (RS) scheme based on deep reinforcement learning (DRL) according instantaneous packet freshness each relay. Simulation results show significant improvement DRL-based RS over state-of-art approaches.
منابع مشابه
Dueling Network Architectures for Deep Reinforcement Learning
In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning inspired by advantage learning. Our dueling architecture represents two ...
متن کاملRegret Minimization for Partially Observable Deep Reinforcement Learning
Deep reinforcement learning algorithms that estimate state and state-action value functions have been shown to be effective in a variety of challenging domains, including learning control strategies from raw image pixels. However, algorithms that estimate state and state-action value functions typically assume a fully observed state and must compensate for partial or non-Markovian observations ...
متن کاملOn-line Building Energy Optimization using Deep Reinforcement Learning
Unprecedented high volumes of data are becoming available with the growth of the advanced metering infrastructure. These are expected to benefit planning and operation of the future power system, and to help the customers transition from a passive to an active role. In this paper, we explore for the first time in the smart grid context the benefits of using Deep Reinforcement Learning, a hybrid...
متن کاملReinforcement Learning with Deep Energy-Based Policies
We propose a method for learning expressive energy-based policies for continuous states and actions, which has been feasible only in tabular domains before. We apply our method to learning maximum entropy policies, resulting into a new algorithm, called soft Q-learning, that expresses the optimal policy via a Boltzmann distribution. We use the recently proposed amortized Stein variational gradi...
متن کاملDeep Reinforcement Learning Solutions for Energy Microgrids Management
This paper addresses the problem of efficiently operating the storage devices in an electricity microgrid featuring photovoltaic (PV) panels with both shortand long-term storage capacities. The problem of optimally activating the storage devices is formulated as a sequential decision making problem under uncertainty where, at every time-step, the uncertainty comes from the lack of knowledge abo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Wireless Communications Letters
سال: 2023
ISSN: ['2162-2337', '2162-2345']
DOI: https://doi.org/10.1109/lwc.2023.3278864